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A FORMULATION AND ANALYSIS OF COMBAT GAMES 
Michael Heymann,* Mark D. Ardema, and Narayanaswami Rajanf 

Ames Research Center 


SUMMARY 


Most investigations which use the theory of differential games to analyze 
combat problems have focused on deterministic, two-person, zero-sum, perfect- 
information pursuit-evasion games. This framework is quite suitable when the 
pursuer-evader roles are well defined by the nature of the problem and the evader 
has no offensive capability with which to threaten the pursuer to affect the 
outcome. The formulation is, however, inadequate to model combat between two (or 
more) opponents when both (or all) have offensive capabilities and objectives. An 
obvious example of such a situation is air-to-air combat. The few attempts to 
analyze this more general combat problem have used either concepts of role- 
determination or, more recently, that of a two-target differential game. Neither of 
these approaches, however, has led to a complete and consistent conceptual defini- 
tion and corresponding mathematical theory of combat games. It is our purpose in 
this paper to formulate and illustrate such a theory. 

We begin with a discussion of the qualitative features of combat games between 
two aggressive opponents; this discussion indicates the rich variety of behavior 
present in such games and makes clear the inadequacy of the pursuit-evasion assump- 
tion, with or without role determination, for modeling combat. We then propose a 
mathematical formulation of deterministic combat games between two opponents with 
offensive capabilities and offensive objectives. Resolution of the combat essen- 
tially involves solving two differential games with state constraints. Depending on 
the game dynamics and parameters, the combat can terminate in one of four ways. 

(1) the first player wins, (2) the second player wins, (3) a draw (neither wins), or 
(4) joint capture. In the first two cases, the optimal strategies of the two 
players are determined from suitable zero-sum games; whereas, in the latter two. 
cases, the relevant game is nonzero-sum. Next, to avoid certain technical difficul- 
ties, the concept of a 6-combat game is introduced. 

To illustrate the definition, formulation, and solution of combat games, an 
example, called the turret game , is analyzed in detail. This game may be thought of 
as a highly simplified model of air combat, yet it is sufficiently complex to 
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exhibit a rich variety of* combat behavior, much of which is not found in pure 
pursuit-evasion games. 


1 . INTRODUCTION 


By a game of combat we intuitively refer to an encounter between two hostile 
adversaries, or players , each of whom wishes to destroy or capture the other, while, 
if possible, ensuring his own survival. A player who succeeds in capturing his 
opponent is said to win the game; if he can win, he will try to win with as little 
cost to himself as possible. If he is unable to do so, he will try to make his 
opponent's win as difficult or as costly as possible. 

A conspicuous example, and one of the main incentives for investigating games 
of combat, is the aerial combat problem in which there is a duel between two (or 
even more) maneuvering aircraft. Various situations can be visualized; for example, 
a missile in pursuit of a plane, a fighter in pursuit of a bomber, a duel between 
two fighter aircraft. 


In the simplest manifestation of a combat game, one of the players has no 
offensive capabilities so that he can never win in the above sense. Thus, the 
offensive player becomes the pursuer . and the inoffensive opponent becomes the 
evader. The resultant pursuit-evasion problem becomes what is sometimes called a 
g^j” e survival (see e.g., ref. 1). The evader attempts to avoid capture or, if 
this is not possible, to maximize the pursuer's cost of attaining his goal; the 
pursuer endeavors to capture, and at minimum cost. The special (but important) case 
in which the cost functional is the time to capture, has sometimes been referred to 
as a game of pursuit-evasion (ref. 1). 


The early studies of combat problems focused almost entirely on the above- 
mentioned framework in which the two players have the clearly defined (and oppos- 
ing) roles of pursuer and evader, or minimizer and maximizer. Immediate applica- 
tions of those studies are in such problems as missile versus aircraft or fighter 
versus bomber . 


The generally accepted mathematical framework for formulating and solving 
pursuit-evasion problems is the theory of differential games according to Isaacs 
(ref. 2). Specifically, the game consists of a dynamical system whose state transi- 
tion is governed by a set of n ordinary differential equations 


dx 

dt 


f(t,x,u,v) 


( 1 ) 


where x = x(t)e is the state, u = u(t) eUc* m and v = v(t) e VcfP are 
the two players' controls, and x(t Q ) = x Q is the initial state. Associated with 
the game is a cost functional 


J(u,v,x o ,t Q ) = g[x(t),t] + I h[t,x(t) ,u(t) ,v(t) ]dt , 


c 


( 2 ) 
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where t(o) is a free or fixed termination time. Typically, player u, the pur- 
suer, attempts to minimize the cost J, whereas v, the evader, tries to maximize 
it. (Throughout the paper we do not make a notational distinction between the 
players and their controls, the meaning being clear from the context.) Various 
additional assumptions and constraints can be imposed on the problem to suit spe- 
cific requirements. 

In games of survival and of pursuit-evasion, the termination time is free and 
is determined by a capture condition ; that is, a state constraint that is imposed on 
the problem as follows: 


x(t) <2 & cM n v t < t < t (3) 

o 

and 

x(t) G S' . (4) 

Here & is the target or capture set and t, defined through equations (3) and (4), 
is the capture time. The game of pursuit-evasion is thus the special case of the 
above framework in which g = 0 and h = 1 , with the cost being the time to cap- 
ture, (t - t ). It is then natural to refer to the minimizing player u as the 
pursuer and to the maximizing player v as the evader. 

Much research has been done on games of survival and games of pursuit-evasion 
within the general framework of differential game theory (see, e.g., 
refs. 1, 3-5). The early use of differential games in the modeling and analysis of 
aerial combat problems is reviewed in reference 6; a more recent review may be found 
in chapter 8 of reference 7. Although highly idealized, widely investigated models 
for pursuit-evasion analysis are the homicidal chauffeur game (refs. 2,8,9), and its 
generalization, the game of two cars. The latter was used as a model for aerial 
combat analysis in a variety of studies; see, for example, references 10-15 as well 
as the general survey article, reference 16. More recently, various generalizations 
of the game of two cars were used for analysis of aerial pursuit-evasion to accommo- 
date variable speed and other aircraft capabilities (refs. 17-20). Also, special 
techniques were developed and examined to facilitate computation and to alleviate 
some of the difficulties associated with high dimensionality. 

It was realized even in the early stages (ref. 6), however, that the pursuer- 
evader model is inadequate for a situation such as fighter versus fighter combat, in 
which there is no justification for an arbitrary a priori role assignment of pursuer 
and evader. This difficulty led to much confusion in efforts to reconcile the 
differential game methodology with intuition, based on the perceived experiences in 
actual combat situations. For example, it was stated in reference 6 that a multiple 
criterion may be required to formulate these aerial combat encounters correctly and 
that role reversals during a given encounter typically occur. In other studies, the 
idea was promoted that the central issue is that of role determination 
(refs. 7,13,21); that is, deciding which of the two players should assume the role 
of pursuer and which that of evader. While various approaches to the role 
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determination problem were proposed, it was generally believed that the winning 
player should always assume the role of pursuer while his opponent should assume 
that of evader. 

Realizing that the existing approaches for determining the potential winner of 
a combat game were unsatisfactory, Getz and Pachter (refs. 22 and 23) adopted the 
concept of a two target game that was introduced earlier to the differential game 
literature by Blaquiere et al. (ref. 24) and Getz and Leitmann (ref. 25). In this 
setting, each player has a target set and attempts to drive the state into his 
target set without being first driven to the target set of his opponent. A further 
recent study that combines the role assignment point of view with the two-target 
idea has been reported in reference 26. 

The work of Getz and Pachter, just as in most past studies of pursuit-evasion 
differential games, has been confined to the problem of capturability ; that is, to 
the "game of kind" (in Isaac's terminology). The approach in such studies has been 
based on Isaac's technique of investigating certain semipermeable surfaces to deter- 
mine barriers and other singular surfaces (ref. 2). This approach has contributed a 
great deal to the understanding of differential games; however, it has two serious 
limitations for application to combat problems. First, although a barrier analysis 
is plausible in very low dimensional problems, it becomes rapidly infeasible as the 
dimensionality increases, especially in the two-target case where added complexity 
arises. Indeed, Getz and Pachter made some major simplifications in their target- 
set geometries to overcome essential difficulties with dimensionality and render the 
analysis tractable in their investigations of the two-target homicidal chauffeur 
game (ref. 22) and the two-target game of two cars (ref. 23). The second limitation 
of barrier analysis is that it does not address the fundamental problem of deter- 
mination of strategies; in fact, the player's actual optimal (or, at least, winning) 
strategies in combat situations remain obscure and unexplored. 

In the present paper, we formulate and examine the combat problem for an arbi- 
trary dynamical system from a strategy-analysis point of view. 


2. QUALITATIVE FEATURES OF COMBAT GAMES 


In an ordinary pursuit-evasion problem, the termination of the game (i.e., 
capture) is determined, as we have seen earlier, by the capture condition specified 
by equations (3) and (4) (which can also be stated in terms of equality or inequal- 
ity constraints). The capturability issue constitutes the "game of kind" (ref. 2), 
and for a given initial state the problem is whether the pursuer can actually force 
capture or termination. The question of strategy (that is, how to accomplish cap- 
ture when possible or how to prevent or delay it) constitutes the game of degree 
(ref. 2). 

In the two-target combat model with targets, say, and & v , the termination 
condition is 
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x(t) <2 S: - S u u S^ V t Q < t < t , (5) 

and _ 

x(t) G S , (6) 

where S u is the target associated with player u, and S v is the target associ- 
ated with v. If x(t) € S u but x(t) 2_f v for all t < t < t, we say that _ 
player u wins the game; whereas, if x(t) e S^ and x(t) 2 S u for t Q < t < t, 
we say that v wins. If S^ n s * 4> and 

x(t) e n , (7) 

we say that the game ends in joint capture . Finally, if T*(>t Q ) is the maximum 
allowable termination time of the game, and if x(t) 2 S for all t Q < t < T*, we 
say that the game ends in a tie or draw . 

In the pursuit-evasion game, the pursuing player wishes to lead the game to 
termination and, if he can, do so as quickly as possible. The evading player 
attempts to prevent termination or to delay it if prevention is impossible. In 
contrast, in the two-target problem the players' objectives are more complicated. 

In principle, both want to terminate the game but in different parts of S. 

Player u wants it to terminate in SIS v (i.e., in S excluding S v ) , and player 
v wants it to terminate in S /& u . 

To see how these conflicting objectives affect the players' strategies and to 
gain some insight into the actual situation, let us examine, qualitatively, a number 
of possible cases. Suppose that the capability of each player to evade his oppo- 
nent's target is independent of, and decoupled from, his capability to pursue his 
opponent. (Air combat between two aircraft with actively guided air-to-air missiles 
is an example.) Each player would then play two simultaneous and independent 
pursuit-evasion games: one is an offensive game in which he would try to capture 

his opponent, the other a defensive game in which he would try to evade his oppo- 
nent's weapons. The pursuit-evasion game that terminates first would determine the 
winner. 


Suppose now that with each of the above mentioned pursuit-evasion games we 
associate a cost functional 


Jj_ = g i [x(t.),t i ] 



h^tjxCt) ,u(t) ,v(t) ]dt 


i = u,v , 


where t. is the termination time of game i. In game u, player u wishes to 
minimize^ and player v wishes to maximize J u ; in game v, player v is the 
minimizer and u the maximizer. Suppose that, if the competition between the two 
games is ignored, it develops that t is greater than t as optimal termination 
times. By this optimality criterion it would be concluded, quite possibly erro- 
neously, that player v is the winner of the combat. Thus, even in this elementary 
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example, we are forced to add a constraint to each of the two games to account for 
the existence of the other, that is, for i = u,v, x(t) g &/ST. for all t < t < t. 
(where ^ denotes the target set of the game i). This constraint introduces a 1 
coupling between the two competing games that affects the players' strategies. A 
particularly interesting and important cost criterion is obtained when g^ = 0 
and = J.’ i = u > v » that is, when both games assume a time-optimality criterion. 

Let = t^, i = u,v be the optimal times obtained in the two pursuit games where, 

in each game,_the pursuer is the minimizer and the evader is the maximizer. 

Clearly, if t y < t then in game v the constraint x(t) g Pf? for all 
fc 0 - < fc v is violated and the constrained game v has no feasible solutions. 

Obviously u is the winning player. 

In the more general case, the players cannot perform their evasive maneuvers, 
that is, they cannot stay out of range of their opponents' weapon envelopes (capture 
sets), independently of their offensive maneuvers to capture the opponent. Indeed, 
typically, there is a trade-off between the two objectives of survival and capture 
of the opponent, and the players have to play their strategies accordingly. 

To illustrate the situations that might occur, consider two vehicles maneuver- 
ing in a horizontal plane (fig. 1). The arrows describe the vehicles' instantaneous 
headings, the cones the instantaneous envelopes of their weapons (fixed with respect 
to their headings) and the vertices of the cones are their instantaneous 
positions. We assume the typical situation that each player's maximum turn rate and 
speed are mutually dependent; specifically, the faster they move the slower they are 
able to turn, and conversely. 

Suppose that player v initially is in a vulnerable position (see fig. 1) such 
that by a slight turn of player u, v might enter u's weapon envelope, and 



Figure 1.- Maneuvering vehicles. 
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further suppose that v is more maneuverable in terms of turn rate and speed than 
u. If v adopts a pure evasive maneuver and u pursues, two outcomes are pos- 
sible: either v gets captured quickly or he evades successfully. If he can avoid 
capture initially, then in due time presumably he can capture u by virtue of his 
superior maneuverability. In the other case, even though v cannot avoid initial 
capture by u if he adopts pure (inoffensive) evasion, he may still be able to win 
the game by performing the offensive maneuver of turning his own target at u, 
thereby capturing u before u captures him. More generally, v may have to 
perform a composite (offensive-defensive) maneuver, wherein he turns his target at 
u while moving away from u's target just enough to avoid being captured him- 
self. Thus, player v might be able to win the game by a composite strategy even 
though he would lose it by playing pure evasion. 

It is also of interest to examine the optimal play of player u under the 
assumption that v can win with an optimal composite strategy. In spite of his 
eventual capture, u should adopt an offensive behavior because when u turns his 
weapon at v, v's offensive move is slowed down in order to survive u's threat. 

At least, thereby, u's capture is delayed. In this situation, roles of pursuer and 
evader cannot usefully be assigned to the two players. 

Thus it is easy to understand that, in general, analysis of pure pursuit- 
evasion problems, with or without role-determination analyses, reveals little, if 
any, information about the possible outcomes and optimal strategies of a combat 
game. In fact, it may be expected that misleading conclusions frequently will be 
drawn. It is clear that a new and fundamentally different approach to the problem 
is required. 


3. FORMULATION OF COMBAT GAMES 


Consider a system described by a set of n ordinary differential equations, 

= f(t,x,u,v) , (8) 

with initial time t Q and initial state x(t Q ) = x Q . The controls of the two 
players are measurable functions taking values in compact subsets U c and 
VcfP, respectively. 

<| 

Associated with the combat problem are two subsets S' and in St , the 
(extended) targets or (extended) capture sets of the players. We will assume that 
& and & are the closures of some open subsets in # n+1 and that there exists a 
time T» V > t 0 such that for all x e St n and all t > T*, (t,x) e n 

Combat starts at t = t Q and continues as long as 

[t,x(t) ] £ int & 
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Where 9~\ = 9~^ u 9"^ is the combat's (extended terminal set and where int( • ) 

denotes interior. We shall say that the combat terminates at time t where 

t: = inf {t > t Q |[t,x(t)] g int 9} . ( 9 ) 

If t = T#, we say that the combat ends in a draw . If t < T#, we say that player 

u (respectively^ player v) wins the combat if there exists an e >_0 such that 

[ t » x (_t ) ] g int 9"^ (respectively, [t,x(t)] g int 9 ) for all t > t satisfying 
t - t < e. If both players win the combat we speak V of Joint , or simultaneous . 
capture . Thus, the combat can terminate in one of the following four ways: (1) a 
win for player u; (2) a win for player v; (3) a draw; and (4) a joint capture. 

To obtain a consistent formulation of the combat problem, it is necessary first 
to resolve its decidability question. That is, each initial event (t Q , x Q ) must be 
uniquely and unambiguously classifiable into one of the four termination categories 

(1) -(4) above, thus partitioning the event space 9lx9l n into mutually exclusive 
regions $ u , $ v , <P uVv , and $ uAv , respectively. 

To this end we define the players' termination preferences as follows. 

Played, 4 ii ranks his preferences in order of priority as (1), (3), (4), (2), and 

player v ranks his preferences as (2), (3), (4), and (1). 

Remark 1 - This ranking is consistent with the intuitive notion that each player 
wishes to capture his opponent while not being captured himself. It also resolves 
the ambiguity that might occur in deciding between outcomes (3) and (4) when (1) and 

(2) cannot be forced by either of the players. This last point becomes clear if we 

observe that outcomes (3) and (4) can occur essentially in one of two ways: the 

players may be "locked into joint capture," in the sense that a unilateral attempt 
by one of the players to postpone termination will enable his opponent to win; on 
the other hand, if a player cannot force a win but has control over the time at 
which joint capture will occur, he will select the latest such time and, if pos- 
sible, set it at T* (i.e., a draw). □ 

It is readily noted that by definition, the regions $ u , $ v , 4> uVv , and $ 
are invariant in the sense that there exist strategies for the players that maintain 
the resultant trajectory in its initial region until combat termination. Moreover, 
any sensible or, as we shall say, consistent strategies by the players will satisfy 
this invariance. Indeed, a trajectory will leave its initial region only if at 
least one of the players makes a fatal strategy error, in which case we say that the 
game strategies are inconsistent. 

Remark 2 - It is important to emphasize that, in properly formulated and cor- 
rectly played combat, the winning capability of a player depends only on the problem 
data (including the initial state). No reversals of the winning capability (or 
"role") occur unless a fundamental error has been made by a player who relinquishes 
an advantage to his opponent. □ 
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We now associate with the combat problem a pair of differential games, one from 
the point of view of player u, or the u-game , and one from the point of view of 
player v, or the v-game . 

The u-game G u is defined as follows. Given is a cost functional 

/ t u 

h u [t,x(t),u(t),v(t)]dt , (10) 

t 

o 

with player u defined as the minimizer and player v as the maximizer. The 
terminal time t is specified by 

t = inf{t > t Q |[t,x(t)] G int ^ u ) , (11) 

subject to the event constraint 

[ t , x ( t ) ] £ int 9 V t < t < t, . (12) 

' v o u 

The v-game G y is defined as follows. Given is a cost functional 

J t v 

h v [t,x(t),u(t),v(t)]dt , (13) 

c 

o 

with player v defined as the minimizer and player u as the maximizer. The 
terminal time t is specified by 

t = inf{t > t Q |[t,x(t)] G int , (14) 

subject to the event constraint 

[ t , x( t ) ] g int V t Q < t < t v . (15) 

i 

We examine now the role of the two differential games in the formulation, of the 
combat problem. First, note that if (t Q ,x 0 ) G $ u u « v exactly one of the games Has 
feasible solutions (satisfying the terminal condition and event constraint) so that 
only the game of the winning player (the one with feasible solutions) can be 
played. The winning player will then choose his strategy to minimize his own cost 
functional (subject to the terminal and event constraints of the game); his oppo- 
nent, realizing that he has no alternatives (having no feasible solutions to his own 
game), will play to maximize his opponent's cost. Thus a zero-sum game results with 
the winning player the minimizer and his opponent the maximizer. 

In case (t^Xg) G $ uVv u $ uAv , both games have feasible strategies, and the 
terminal times t and t coincide (to the least time in which each player can 
force capture of His opponent or a draw; see also Remark 1). In this case each 
player will ..choose to minimize his own cost functional (while ignoring his 
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opponent's). The resultant game is a nonzero-sum game with event and terminal 
constraints. 

In summary, each player will choose his strategy to minimize the cost func- 
tional of his own game, unless for the given initial conditions his game has no 
feasible solution (that is, he is forced to lose the combat). In that case he will 
choose his strategy to maximize the cost of his opponent. 

Definition 1 - A combat problem formulated with the aid of dual differential 
games, and with strategy selections as described in table 1, is called a combat 
game . □ 


TABLE 1.- STRATEGY SELECTION RULES 
IN COMBAT GAMES 


Region 

$ 

u 

$ 

V 

$ uVv U $ uAv 

Strategy of 

u 

min J u 

max J v 

min J u 

Strategy of 

V 

max J u 

min J v 

min J y 


Remark 3 - Within the dual differential games framework proposed in the present 
paper, rules for strategy selections other than the one described above may be 
chosen. For example, we might have decided to select the players' strategies to 
maximize their opponents' costs instead of minimizing their own costs when 
(t 0 ,x 0 ) ^ $ uVv u $ uAv Although this is logically consistent, we prefer the setup 
as proposed above since we find it more in line with expected intuitive response. 
From a purely mathematical standpoint it makes no essential difference what strategy 
selection rule is chosen so long as it is consistent and decidable. □ 

In games of combat in general, the terminal time is strongly influenced by the 
competing nature of the two differential games G u and G y . When (t 0 ,x Q ) e $ uAv , the 
terminal time is forced to be the least time in which the players can, respectively, 
secure termination of their game, and the times for the two games coincide. Conse- 
quently, a case of special interest and simplicity (in terms of strategy selection), 
and one that is also of practical importance, is when g u = g v = 0 and 
h u = h v = 1; that is, the cost functionals of both games are the durations of the 
games. Thus in the time-optimal case, for (t Q ,x 0 ) e $ uVv u $ uAy essentially every 
feasible strategy is optimal. 


4. FORMULATION OF 6-COMBAT GAMES 


With certain types of cost functionals, the combat game as formulated in sec- 
tion 3 may not have optimal strategies because the space of admissible trajectories 
is not closed. Specifically, because the target sets and are closed, their 
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(nonempty) intersection S n S is also closed. Hence, the set __ 

S*\ = S /(S n S ), the complement of n S in S^, and the set S* (similarly 
defined 1 /, are no? closed. As a result, a convergent sequence of winning trajec- 
tories for one of the players, say for player u, that terminates in 0"* need not 
converge to a trajectory that terminates at a point in but, rather, in 

S n g . However, then the limiting trajectory ends in joint capture and not in a 
win for u . 

We now turn to reformulate the combat game to avoid the technical difficulties 
referred to above. To this end we introduce the concept of a 6-safety margin and a 
6 -combat game . 

Let 6 be a positive number and let (6): = S^(^ u n S^) denote the open 

6-neighborhood of & n & that is, the set U o? all points <t>: = (t,x)e^e n+1 whose 
(Euclidean) distance 0 d(<t>V u n S ) from n is less than 6. Let 

S' (6): = S /^ ( 6 ) denote the points of S u that are not in ^ uAv ( fi )* Similarly 

define 

The 6-combat starts at time t = t Q and continues as long as 

[ t , x( t) ] £ ir\t[g u ^ uAv ( 6 )] • 

We shall say that combat terminates at time t = t where 

t: = inf {t > t Q | [ t , x ( t ) ] G int[^ u ^ uAv ( 5 )]} • O 6 ) 

The winning conditions of the 6-combat differ, however, from those of the "ordinary" 
combat as follows. 

We say that the 6-combat ends in a 5-draw if t = T* - 6. However, if 
t < T* - 6 we distinguish between the following outcomes. We say that player u 
wins if 

[t,x(t)] e ^ u (6) , (17) 

that v wins if 

[t,x(t)] e S V (S) , (18) 

and that the combat ends in joint 6-capture if both players win. 

The 6-combat, similarly to ordinary combat, can end ( 1 ) by u winning, 

(2) by v winning, (3) in a 6-draw, or (4) in joint 6-capture. The players' 
termination preferences are now defined as follows: Player u ranks his prefer- 
ences in order of priority as (1) u wins, (2) a 6-draw or joint 6-capture, and 

(3) v wins. Player v ranks his preferences in reversed order. As discussed in 
some more detail below, no prior preference distinction is made between 6-draw and 
joint 6-capture. 



Given the above termination preferences it is clear that for an initial event 
(t 0 ,x 0 ), one of the players, say player u, can win the 6-combat game if, and only 
if, he can select a strategy with which he will win the game against every possible 
strategy of his opponent. We shall then say that (t 0 ,x Q ) is in u's 6-winning 
zone, denoted $ u (6). Similarly we can define $ y (6), the 6-winning zone of v. 

The fundamental difference between the ordinary combat problem and the 
6-combat problem is that in the latter case the winning terminal sets & (6) and 
#" v (6) are closed and 3^(6) n ^ (6) = <t>. Thus u winning and v winning cannot 
occur simultaneously, and in formulating the 6-combat game we have to proceed 
somewhat differently than for an ordinary combat problem. 


To obtain optimal strategies, we associate with the 6-combat problem a pair of 
cost functionals J u , the cost for player u, and J y , the cost for player v. 

These are defined by 


and 


where 


J u = S u [x(t),t] 


J v = g y [x(t),t] 



h u [t,x(t),u(t),v(t)]dt 


h y [t,x(t),u(t),v(t)]dt 


g u , g v , h u , and h y are suitably defined smooth real functions. 


(19) 


( 20 ) 


If the initial event [t 0 ,x Q ] e $ u (6), that is, player u has winning strate- 
gies, he will choose his strategy to minimize J u subject to the termination con- 
straint (17) and to the event constraint 


[ t , x( t) ] g int[^ uAv (6) U V t Q < t < t . 

Player v, having no feasible (winning) strategies, will play to maximize player 
u's cost functional J u . The resultant game is a zero-sum event-constrained dif- 
ferential game with player u minimizing, and player v maximizing, the cost func- 
tional J u . The responsibility of satisfying the event constraint rests with 
player u, while v tries to violate it. 

Conversely, if [t 0 ,x Q ] e <s y (6), the roles of players u and v are reversed. 

The relevant cost functional becomes (20) and the event constraint is 


[ t , x( t ) ] g int[^ uAy (6) U .T u ] V t Q < t < t . 

There remains the case when the initial event [t 0 ,x Q ] is neither in ® u (6) nor 
in 9 V ( 6 ) . Clearly no winning strategies exist for either player in this case (in 
respect to the 6-combat game). However, it is still possible that for some smaller 
6 > 0, (t Q ,x o ) e $ u (6) u $ y (6). In that case winning strategies can be selected for 
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the players just as before but in respect to the 6-game instead of the 6-game. 

If, on the other hand, ( t Q , x Q ) £ $ u (6') u $ v (6') for all 6* > 0, then it follows 
that (t 0 ,x Q ) e $ uVv u $ uAv where $ uVv is the draw region and « uAv is the joint 
capture region of the game as defined earlier. In this case we adopt the preference 
ordering between a draw and a joint capture as established earlier for (ordinary) 
combat games; that is, the players prefer a draw to joint capture. In either case 
the players choose their strategies so as to minimize their own cost functionals 
resulting in a nonzero-sum game. 


5. EXAMPLE OF A COMBAT GAME: THE TURRET GAME 


Formulation 

To illustrate the theory just developed we consider a combat game (the turret 
game) that represents a simplified version of the air combat situation discussed 
qualitatively in section 2. 

Player u moves in a plane with arbitrary velocity relative to a fixed refer- 
ence frame (X,Y), and can turn a ray weapon relative to a fixed direction at a 
bounded angular rate a (see fig. 2). Player v moves so that he is always at a 
distance R from u, and he can traverse this circle at an angular speed relative 
to a fixed direction at a bounded rate 6. Player v also has a ray weapon that he 
can turn relative to the line of sight between the two players at a bounded rate 4>. 



► X 

Figure 2.- Turret game in fixed reference frame. 
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For convenience we represent the problem in a relative reference frame with 
origin at u's position and the. y-axis along u's weapon (fig. 3). Letting 

X| = 0- a, x 2 = <|>, u = a, v.=g, and v = -<t>, the kinematical equations of motion 
are 


X 

H 

< 

1 

c 

x-jiO) = xo 

(21) 

X 2 = ~ V 2 

x 2 (0) = xo 

(22) 


with Xj and x 2 being computed modulo 2ir, and where we take t = 0 since the 
system is autonomous. In view of the circular symmetry of the problem, it is easily 
seen that the playing space of interest is 

P = {(x,|,x 2 )|x^ e [0,ir],x 2 e (23) 



Figure 3.- Turret game in relative reference frame. 

The admissible controls are specified within the bounds 

0 < u < u (24) 
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and 


(v r v 2 ) e , (25) 

2 2 
where & + is the positive quadrant of 9t . 

Next, we choose T* , the maximum allowed time duration of the combat. The 
extended targets are then given by 

& - {(t,x 1 ,x 2 ) e P | either (x^ < e 1 and t < T#) or (t > T*)} 

and 

= {(t,x 1 ,x 2 ) G 91* P|either (x 2 < e 2 and t < T#) or (t > T*)} 

where e-|R and e 2 R are the radii of the vulnerability regions_or capture sets of 
the two players (fig. 3). The extended joint capture region 9^ n is given by 

9 n = {(t,x 1? x 2 ) G 9t* P|either (x 1 < and x 2 < e 2 , if t < T#) 

or (t > T*)} . 

In the ensuing discussion we shall, for the most part, assume that T* is 
sufficiently large so that we can ignore the t dimension of the target. Thus we 
shall refer to the (ordinary) target sets (see fig. 4) 

^u = ^ (x l’ x 2 ) e p l x i ~ e 1^ 

= {(x r x 2 ) e P | x 2 < e 2 } 

9^ n = {(x 1 ,x 2 ) G P|x 1 < e r x 2 < e 2 } . 

Upon specifying the safety margin 6 >0, we obtain the 6-winning zones as 
depicted in figure 5. The 6-combat terminates at time t < T# - 6 with one of the 
players winning if t is the first time the state intercepts the set #" u (6) u ^ v (6) 
with an inward velocity. _ The 6-capture of v by u occurs (if it occurs at all) 
if at termination with t < T* - 6, 

w. j ( t ) = e 1 ,x 2 (t) > e 2 + 6 and x^t) < 0 . (26) 

Similarly, 6-capture of u by v occurs if at termination with t < T* - 6, 

x , | ( t ) > e 1 + 6,x 2 (t) = e 2 and x 2 (t) < 0 . (27) 

Alternatively, combat terminates either in joint 6-capture or in a 6-draw. 

Next we will analyze this game for different cases (linear and circular control 
constraints) and for different cost functionals (quadratic and time-optimal). 
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Linear Control Constraint 


Let V (fig. 6) be given by 


v 1 > 0 , v 2 > 0 , 


V 1 v 2 

Z 1 + ~ ^ 1 , 

V 1 V 2 


where v. and are preselected positive bounds. (This case may be viewed as a 
convexification and approximation of the typical situation in which v's motion is 


limited by a lateral acceleration constraint specified by v^Vg < k^ and by 
bounds v 1 < k 2 and v 2 < k^, the dashed lines on figure 6). 



Figure 6.- Linear control constraint. 


The cost functionals are chosen as 



( 29 ) 
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and 



where C 1 and C 2 are positive constants. These cost functionals reflect the com- 
bined (weighted) objective of the winning player, that of minimizing the termination 
time while securing maximum safety (i.e., the maximum final distance from the oppo- 
nent's target set), and the converse objective for the losing player. 

beginning the detailed analysis of optimal strategies, we examine the 
implications of the termination conditions (26) and (27). First, note that v can 
always win from suitable initial conditions because, from equations (22) and (28), 
he can always satisfy the third condition of equation (27). On the other hand, 
for_ u to win he must be able to force the third condition of equation (26), 
x i ( t ) < 0; this implies, using (21), (24), and (28), that 

“ > v 1 (31) 

must hold. Thus, the relative magnitudes of u and v. are of key importance and we 
begin by studying the game with (3D holding. 

First, consider u’s winning fi-game. In this game u wishes to minimize 
and v to maximize J u in equation (29). In order to employ the standard neces- 
sary conditions, the Hamiltonian is defined by 


H = X q C 2 + X 1 (v 1 - u) - X 2 v 2 . (32) 

We may set X Q = 1 (since X Q = 0 adds no new candidates for optimal control) and 
X i and X 2 are constants (because H does not depend on x 1 or x 2 ). If u#, v*, 
and v* are optimal controls, then 

u*,v*,v* = arg( min_ max H) , (33) 

0<u<u v^,v 2 eV 

and 

H(x 1 ,X 2 ,u*,v*,v») = 0 . (34) 


The termination condition is 


X 1 = ^(t) = e., . (35) 

The state constraint is x(t) £ int[^ v u ^ uAv ^ 6 ^ v t e [0,t], or 
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pc 2 > e 2 + 6, x 1 < e y 



< X, < £. 


V t G CO, t] 


From equation (29), the transversal ity conditions give X 2 as 

X 2 = = 9J u /3x 2 = ~ 2 C i X 2^) < 0 • 

Therefore, we may write equation (32) as 


(36) 


(37) 


H = C 2 + X 1 (v 1 - u) + 2C^x 2 v 2 , (38) 

where x 2 = x 2 (t). 

To determine the optimal controls from equations (33) and (34), the sign of 
X 1 is needed. If x 1 < 0, then from equations (33) and (38) v* = 0, v# = v 2 , 
and u# = 0; therefore, H is the sum of two positive terms, violating condi- 
tion (34). Similarly, = 0 leads to violation of (34) and thus 


X 1 > 0 . (39) 

It follows that the optimal control for u is 

u* = u (40) 


and the optimal controls for v are 

v* vX = arg[ max (>- 1 v 1 + 2C 1 x 2 v 2 )] . (41) 

V r V 2 GV 

The optimal solution of this simple linear programming problem will always lie on 
the constraint 


v 


2 



0 < v 1 < v 1 


so that equation (41) becomes 

v* = arg[ max_ (X. - 2C.x 2 v 2 /v )vj . 
0<v 1 <v 1 “ 41 


(42) 


(43) 
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There are three possibilities: 


X 

X 

X 


1 

1 

1 


> 2C 1 x 2 v 2 /v 1 

v * = v 1 , V* = 0 , 

(44a) 

< 2C 1 x 2 v 2 /v 1 

4 

< 

II 

o 

< 

ro* 

ii 

r^ 1 

(44b) 

= 2C^x 2 v 2 /v 1 

=> v|,v^ singular . 

(44c) 


Next, consider v's winning 6-game with condition (31) holding. Now, u 
wishes to maximize and v to minimize equation (30) subject to the termination 
condition 


x 2 = x 2 (t) = e 2 


and the state constraint 


(45) 


x 1 > e 1 + 6, x 2 < e 2 


> e, + /? - 


X 1 - £ ^ + /6" - (x 2 - e 2 ) , 
x^ > , e 2 + 6 < x 2 


e 2 < X 2 < e 2 + 6 f V 1 E 


(46) 


Proceeding as before, we conclude that X^ = -20^., u* = u, X 2 > 0, and v's 
possible optimal controls are as given by (44a), (44b), and (44c); however, in this 
case only the choice (44b) satisfies (34). 


We are now in a position to delineate the winning regions $ u (6) (u wins) and 
# v (<$) (v wins). Player u's winning 6-game will be feasible if, and only if, he 
is able to satisfy equations (35) and (36) for all of player v's admissible con- 
trols. At the boundary of $ u (6), v will just be able to make (36) an equality 
with controls (44b). With this choice of controls, (35) and (36) give 


that is, 



x 2 - e 2 ~ 6 
X 1 - e 1 ' 


2 

- = :y 2 
u 


(47) 


which defines $ u (6). 

Similarly, v's winning 6-game will be feasible if, and only if, (45) and (46) 
are satisfied. At the boundary of $ v (6), v can just achieve (45) with (46) an 
equality at t = t and his controls thus will be (44b) here. Integrating (21) and 
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(22) with this control choice and using (46), the specification of the region $ v (6) 
is obtained as 


*2 - e 2 
X 1 ' e 1 - 


* Y' 


(48) 


The curves defined by equalities in (47) and (48) divide the playing space into 
regions of different outcomes and, therefore, following Isaacs (ref. 2), may be 
termed 6-barriers. 

We now determine v's optimal controls in region $ u (6). First, suppose (44a) 
holds; substituting v# = and v* = 0 in (38) and invoking (34) gives 


Putting (49) in (44a) then gives 


2C iY2 u( 1 - Yl ) 

where Y ^: = v^u. From (22) the extremal trajectories in this case are parallel to 
the x^-axis and this, together with (50), defines the region $ (6) in 

figure 7(a). Next suppose (44b) holds; substituting v* = 0 and v* = v 2 in (38) 
and invoking (34) results in 


u - v . 


C 2 Y 1 


(49) 


I C A \ 


X 


1 


C 2 + 2C^x 2 v 2 
u 


(51) 


and putting this in (44b) then gives 


x 


2 


C 2 y 1 


2CiY 2 u( 1 - Yl ) 


(52) 


In this case, from (21) and (22) the trajectories are parallel to the bounding line 
of $ u ( 6 ) as defined by (47) with equality; this, along with (52), defines 

</ 2 ^(6). The last possibility is (44c) for which the controls need satisfy only 


(42), and 


2C 1 X 2 V 2 


(53) 
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Substituting (53) into (38) and using (34) and (42), we arrive at 


X 2 = 


C 2 y 1 


2C 1 y 2 u(1 - ) 


2 * 


( 54 ) 


Therefore, all optimal trajectories in the remaining region, $ ^ (6), terminate at 
( e 1 »x 2 ). Because of the linearity of (21), (22), and (28), a¥l paths using any 
sequence of controls satisfying (42) and reaching (e^^) without violating the 
constraint will take the same time and hence have the same cost; thus the optimal 
controls and paths are not unique in this case. The region $ u (6) is then given by 


$ ( 6 ) = 

u v 


3 

u 

i = 1 




We can now completely specify the optimal strategies in the winning regions 
$ (6) and <f v U) for the two players for the case (31). In *'(«), v plays 
until termination and in ; (6) he plays (44b) until termination. In 
(s), v plays any sequence satisfying (42) giving termination at (ex). In 
the region $ V (<S), v plays (44b). Player u plays u in all winning regions. 
The partition of the playing space is described in figure 7(a) for the case 


2C lY2 u(1 - Yl ) 

/ 1 \ 

Player v's winning region is denoted '(6) in this figure for reasons which 
will become apparent subsequently. This figure also shows example optimal trajec- 
tories in each winning region. 

For initial states satisfying neither (47) nor (48) [i.e., for 
(x»,x£) £ « u (6) u $ v (6)], but satisfying one of these conditions for some 6 < 6, 
the preceding analysis applies with 6 replacing 6. 

It remains to resolve the combat for initial states satisfying neither (47) nor 
(48) for all positive 6 no matter how small. These states lie on a line of 
slope y 2 passing through the point (e.,,e 2 ), given by the dashed line in 
figure 7(a), and for points on this line we must have (x<j,xQ) e <p u $ 

Further, since under condition (31) there can never be a draw (u will always 
eventually win with u* = u unless v does so first), we have in fact joint 
capture, or (xO,x°) e $ uAv - The unique strategies that give this outcome are (40) 
and (44b) and any unilateral deviation from these strategies will result in the 
capture of the deviating player. The locus of points (x 1 , x 2 ) e <& uAv is therefore a 
"barrier" in Isaac's terminology (ref. 2), and could have been determined as the 
semipermeable surface emanating from the intersection of 9" u and 
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(a) y i < 1. 

Figure 7.- Regions and optimal trajectories in the playing space; turret 
game with linear control constraint and quadratic cost. 

Next, we consider the case 

u = v.j . (55) 

In this case, v can always prevent the third condition of (26) from being satisfied 
and thus u can never win (from initial states outside the target's interior). 
Therefore, only v's winning game need be considered. Reasoning exactly as before, 

the region $^(6) is defined by (48) and the optimal controls in this region are 
(40) and (44bT . 
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X 2 



(b) y i = 1. 

Figure 7.- Continued. 

f I \ 

From initial states in the region complementary to $ v '(6), consisting of 
points (x<j, x°) satisfying ^ 


xO - e - 6 > y 2 ’ 

a simple calculation using (21), (22), (42), and (55), shows that neither player can 
win the game with a 6-margin. Hence, the game will end either in a 6-draw or in 
joint 6-capture. 

Letting 6 become vanishingly small, the boundary line of $^(6) converges 
to the dashed line emanating from the point (e 1? e 2 ) in figure 7(bY. All points 
below this line are thus in *^(6) for some 6 > 0. It is readily seen that 
from all points above the dashed line, both players can secure at least a draw; for 
example, by using strategies (40) and (44a) for all t < T* - 6, although draw 
strategies are nonunique for both players. Thus, points in this region are in 

\iVv 

From points on the dashed line u must play strategy (40), otherwise v will 
have winning strategies. On the other hand, if v uses strategies satisfying (42), 
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n 


x 2 

4 


ir 


X 1 



(c) y i > 1 . 

Figure 7.- Concluded. 

with u playing (40), the outcome will be either draw or joint capture. By the 
preference order established for ordinary combat, v chooses a draw. An example of 
a draw strategy for v is to play (44a) for all t < T* - 6, but draw strategies 
are not unique for v. 

The regions with (55) holding are shown in figure 7(b), along with example 
optimal trajectories. 

The last case to consider is 


u < v 1 . (56) 

From (21), (22), (26), and (27), it is obvious that now v can capture u from any 
position in the playing space P that is not in the interior of u's target, and 
therefore <& v (6) = P/int 0^(6). The necessary conditions again give (40) as u's 
optimal control and show that v's optimal controls will lie on (42). In this 
case, however, the necessary conditions do not establish the sign of X 2 ; further, 
the optimal controls of v are frequently nonunique, and the optimal trajectories 
are frequently on the boundaries of the playing space. This means that the neces- 
sary conditions are of little use in determining v's optimal controls and we use 
direct comparison of cost functionals instead. 


25 



The minimum time-to-capture from an arbitrary point (x°,x°) in the region (48) 
is obtained from using controls (44b) in (21) and (22); the result is 

t' = (x£ - e 2 )/v 2 . 


At this time, 


X 1 = - ^ X 2 " e 2^ /y 2 + X 1 


and thus from (30), 

cP = -C^xo - (x° - e 2 )/y 2 ] 2 + C 2 (x0 - e 2 )/v 2 . 


At t', v has the option of forcing penetration immediately or of playing controls 
(44a) and forcing penetration of his target at some later point x'^ at an addi- 
tional time increment t" . Again from (21), 

x” = (v 1 - u ) t" + x!j 

and the cost is 


J" = 
v 


-f y" 
° 1 X 1 


X 2 - 


(x° - e 2 )/y 2 


- X 1 


(57) 


But this function will have a minimum with respect to x" e [ x * ,n] at either 

x'j = x* or x" = it. Thus, we need to compare the cost (57) for x" = x' with that 

for x'j = it. The result is that the minimum-time path will be optimal if 


x<j + it 



C 1 u(y 1 - D 


(58) 


and the path ending at (it, 62) will be optimal when inequality (58) is reversed. 
Mote that because of the linearity of (21), (22), and (42), all paths using any 
sequence of controls satisfying (42) and reaching (tt,e 2 ) without violating the 
constraints will take the same time. Thus the optimal controls and paths are not 
unique in this case. 


The line separating the two regions (obtained by replacing the inequality by 
equality in (58)) has slope Y2 and is thus parallel to the minimum time paths. 

This surface intercepts v's target at C 2 /[C^u(y^ - 1)1 - it. 

The regions and example trajectories are shown in figure 7(c) for the case (56) 
an^ the condition + 6 < C 2 /[C^u(y^ - 1)] - it < it. The optimal paths in region 
( 6 ) are nonunique (only tne two extreme paths are shown), and all end at 
(e^ + 6, e^) in minimum time; the paths in '(6) are unique, and minimum time; 
and the paths in <5^ '(S) are nonunique, and all end at (it, e 2 ). If 
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C 2 /[C.|U(y.| - 1 ) ] - it < e. + 6, all optimal trajectories end at (it, e 2 ) and if 
C 2 /[C^u(y^ - 1)] > 2u all trajectories are minimum time. 

A special case of these results is time -optimality (C^ = 0 and C 2 = 1). In 
this case, the regions in which the optimal trajectories are not time-optimal vanish 

( o ) (1) (1) 

(specifically, ( 6 ), (6), and (6) in figs. 7(a), 7(b), and 7(c). Also, 

in region (48) u's control is not defined since it has no effect on the outcome of 
the game. These results are summarized in figures 8(a), 8(b), and 8(c); the optimal 
strategies in each region are apparent. 



(a) y i < 1. 

Figure 8.- Regions and optimal trajectories in the playing space; turret 
game with linear control constraint and time-optimality. 


Circular Control Constraint 
Now let V (fig. 9) be given by 

v 1 >0, v 2 > 0, v 2 + v 2 < v 2 , (59) 

where v is a preselected positive bound, and choose both costs as time-to-capture 
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Figure 9.- Circular control constraint. 



( 60 ) 


As before, inspection of ( 22 ) and (27) shows that v can always force termina- 
tion (i.e., x 2 (t) < 0) and is therefore always capable of winning; whereas from 
(21), (26), and (59), u can win only if 

u > v . (61) 


We begin by investigating the game with this condition holding and first consider 
u's winning game; that is, u minimizes and v maximizes (60) subject to (21), 
(22), (24), (35), (36), and (59)- The Hamiltonian is 


_2 2 2 

H = 1 + X.(v 1 - u) - X 2 v 2 + y( v - v 1 - v 2 ) 
where the (ordinary) multiplier y satisfies 



( 62 ) 


(63) 


and X 1 and X 2 are constants. The optimal controls for v must satisfy 
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( 64 ) 



Since u = 0 violates the condition H = 0, required for optimal controls, (63) 
implies that v's optimal controls satisfy 

2 2-2 

v 1 + v 2 = v* (65) 

Further, H = 0 and conditions (64) imply that 


X 1 > 0 and X 2 < 0 


so that 


( 66 ) 


u* = u . (67) 

If (36) is satisfied with strict inequality at t = t, the transversality conditions 
give X2 = 0, and from (64) and (65), for this case, 

v* = v, v* = 0 . (68) 

To determine the boundary of u's winning region, note that v desires to 
choose controls (subject to (65)) to make this region as small as possible. Assum- 
ing constant controls and using (65), (21) and (22) may be integrated to give 

e 1 = + (v 1 - u)t (69) 

and 

x 2 = x° - (v 2 - v 2 ) 1/2 t . (70) 

Evaluating (36) at t = t and using (69) and (70) gives 

. ( 71 ) 

U - V, 

p 

Therefore, u's winning region will be smallest when = y u, where y = v/u; 
putting this value in (71) gives the specification of region $ u (s) as 

X o - e 2 - 6 > ( X o - El )/(1/y 2 - 1) 1/2 . (72) 

The optimal strategies in this region are (67) and (68). 
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Next, consider v's winning 6-game for (61) holding; now u maximizes and 
v minimizes (60), subject to (59). The Hamiltonian is (62) as before, but now 
y < o in (63). Proceeding as before, if (46) is satisfied with strict inequality 
Xi =0, then v's optimal controls are 


v 


* 

1 


= 0 , 



( 73 ) 


and u's control is indeterminate. For (46) satisfied with equality at t = t, 
u's optimal control is given by (67) and v's optimal controls are constants 
satisfying (65). Integrating (21) and (22) with these controls gives 



t 

6 


/ „ . ,/-2 2 . 1/2 
(xo - e 2 )/(v - V 1 ) 

(v 1 - u)(x° - e 2 )/(v 2 - 



(74) 


Solving the latter equation for v.|, 


. r 2,. 2x ,,1/2 

v _ r 1 - n[y (1 + n ) - li 

V 1 ‘ ,2 

1 + n 


(75) 


where 


n 





(76) 


This control will be used in the region in which (75) will have real solutions 
satisfying (65); that is, 

(1/y 2 - D 1/2 * n < 1 /y • (77) 

Note that the controls corresponding to the lower and upper bounds in (77) are 
v^ = y 2 u and v^ = 0, respectively. 

We have now determined the regions and optimal controls for the case (61). 

In $ u (6), defined by (72), the optimal controls are (67) and (68). In $ v (6), 
defined by 


n > 1/y i (78) 

(2) 

the controls are (73) with u indeterminate. In (6), defined by (77), the 

optimal controls are given by (65), (67), and (75). 

To resolve the combat in the remaining region, that is, the region which satis- 
fies neither (72) nor (1/y 2 - 1 ) ' 2 < n, we proceed as in the linear constraint 
case. Initial states not satisfying these inequalities for all positive 6, no 
matter how small, lie on a line of slope (1/y 2 - I)" 1 passing through (e-j, e 2 ). 
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This line, the dashed line in figure 10(a), is a locus of joint capture initial 
states for which the unique controls are (67), v* = y u, and v* = yu(1 - yt'*' 
The various regions and example optimal trajectories are shown m figure 10(a). 


® x8- — + eo + 8 

2 7^7 


© x o 
2 



Figure 10.- Regions and optimal trajectories in the playing space; turret 
game with circular control constraint and time-optimality. 

The cases u = v and u < v may be easily inferred from the results for the 
case (61). As y -*• 1 from below, the slopes of the boundary lines labeled (1) 
and (2) in figure 10(a) become infinite. Therefore, $ u (6) vanishes and the upper 
boundary of $ v (6) now depends on the prespecified value of T*w To find this., 
boundary, we integrate (21) and (22) using (65) and (67) backward from (e^ + 6 , z^) 
ov|r an interval of time T* - 5. The result is that initial conditions in region 
^(6) must satisfy 
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(79) 


(x° - e 2 ) 2 < (x° - e 1 - «)[2u(T* - 6 ) - x° + e 1 + 6 ] . 

(i\ (p) 

Regions (6) and 9^ ' (6) are shown in figure 10(b), and the controls in these 

regions are as previously but with y = 1 . 

From initial states in the region complementary to (79), neither player can win 
the 6-game. Letting 6 become vanishingly small, the boundary line associated 
with (79) becomes the dashed line shown in figure 10(b). Points below this line are 

in 9^\l) for some 6 > 0, and from points above this line both players can 
secure a draw. As before, draw strategies are not unique. From initial states on 
the boundary line itself, v has the option of draw or joint capture at t = T*; by 
the preference order for ordinary combat, he chooses draw. 

Finally, for u < v (fig. 10(c), $ v (6) is the entire playing space, again 
composed of regions $^(6) and 9^(&). 

6. DISCUSSION 


Although the turret game is a very simple and idealized problem, analysis of 
this game has revealed a rich variety of combat phenomena. First, note that the 
solution to this game exhibits many features commonly found in differential games, 
such as the existence of barriers and "singular” surfaces. An example of a singular 
surface is given by (58) with the inequality being replaced by equality; this is a 
singular surface of type (p, u, -) in Isaacs' terminology (ref. 2). 

The optimal strategies also exhibit features common in differential game solu- 
tions. In most regions of the state/parameter space, the optimal controls of both 
players are unique and constant. There are regions, however, in which the controls 
(o\ ( 2 ) (7) 

are nonunique (<J^ (6) in fig. 7(a), and 9^ (6) and (6) in fig. 7(c)). We 

note that in many cases the optimal strategies are obvious; for example, from ini- 
tial conditions in the region 9^ '(6) in figure 7, player v can capture u 
(before being himself captured) by simply "standing" and turning his turret at the 
maximum rate. 

The turret game solution, however, shows that combat problems have features not 
encountered in games of survival and pursuit-evasion. One of these features is the 
existence of a manifold on which both players are locked into mutual destruction at 
the earliest possible time, in the sense that any deviation from this policy by one 
of the players will result in his unilateral capture by the other. This situation 
is mentioned in reference 7, and has been found to occur also in nonoptimal air 
combat simulations (ref. 27). Another feature is a manifold on which one of the 
players has the unilateral choice between a draw and mutual destruction at a time of 
his choosing. 

The idea of 6-combat games, introduced to solve technical problems concerned 
with closure properties of the target sets, also has important practical 
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implications. In the turret game, 5 is closest that the winning player is allowed 
to approach his opponent's target at termination. Thus, the winning player can 
choose 6 to specify to what degree he is willing to accept the risk of his own 
capture. 


Because of past emphasis on pursuit-evasion problems, it is of interest to 
examine the turret game from a pursuit-evasion standpoint. First consider the 
(time -optimal) pursuit-evasion game with u as the pursuer and v as the evader 
(u/v) subject to (24) and (28). The necessary conditions give the optimal strate- 
gies as u* = u, v^ = v^, and v* = 0 for v^ < u, and capture occurs if 
v^ < u. For v/u, the optimal controls are v# = 0, v* = and u# undefined, 
and capture always occurs. 


Now suppose we (naively) attempt to construct the combat results from the 
pursuit-evasion results by assuming that whichever pursuit-evasion game ends in the 
least time will be_the one played. Then, for v^ > u, the v/u game will be played 
everywhere. For v^ < u, the times of the two games must be compared. Integrating 
(21) and (22) with the two sets of controls shows that the u/v game will be played 
if 


x i - 

xo - . 



( 80 ) 


and conversely for the v/u game. These results are shown in the playing space in 
figure 11 for 6 vanishingly small. Note that the slope of the boundary line (80) 
is greater than the slope of the boundary lines (47) and (48). 

Comparing figures 8 and 11, we see that in the regions (*) on figure 11 the two 
analyses give the same solutions, but that in the other regions the solutions are 
dramatically different. In all these other regions, the pursuit-evasion solution 
indicates that v will win, v's optimal strategy is (0, v^), and u's strategy is 
immaterial. In region (1) on figure 11(a), however, the combat game results show 
that if u plays u, then u will win. Moreover, u will win in minimum time, 
if v persists in playing his pursuit-evasion-derived strategy. In (2) on 
figure 11(b), the combat results show that the best v can achieve is a draw, and 
that he must play (v., 0) to do this; if v plays his pursuit-evasion strategy, u 
will win. And in (3) on figure 11(c), v can in fact win but he must recognize and 
avoid u's target to do so. 

Thus from v's standpoint, the pursuit-evasion results frequently tell him he 
can win when he cannot. Moreover, use of the pursuit-evasion strategies frequently 
will cause v to be captured when capture is avoidable, or lead him to be captured 
in minimum time when capture cannot be avoided. From u's standpoint, the pursuit- 
evasion results frequently tell him that he will be captured and that his strategy 
selection is of no consequence, when in fact he has winning or draw strategies. 

Thus, the serious fallacy of using pursuit-evasion methods to "solve" combat prob- 
lems (i.e., differential games between opponents with offensive capabilities and 
offensive objectives) is clear. 


35 









As a final point in our discussion, we wish to reemphasize the role of threat 
for optimal strategy selection in suitably formulated combat games. We have clearly 
seen above that both players combine offensive and defensive behavior in their 
optimal strategies. The winning player, during his offense, takes defensive mea- 
sures to avoid being captured himself. At the same time, the losing player, usually 
thought of as defensive, applies a threat to the winner. That is, he also imple- 
ments his offensive capability in order to prevent his opponent from using the most 
damaging strategies (in terms of the formulated game's cost). In a properly formu- 
lated combat game, just as in actual combat, both players combine a suitable blend 
of offensive and defensive maneuvering. 


Ames Research Center 

National Aeronautics and Space Administration 

Moffett Field, California 94035, December 20, 1984 
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